5 research outputs found

    Machine Learning in Automated Text Categorization

    Full text link
    The automated categorization (or classification) of texts into predefined categories has witnessed a booming interest in the last ten years, due to the increased availability of documents in digital form and the ensuing need to organize them. In the research community the dominant approach to this problem is based on machine learning techniques: a general inductive process automatically builds a classifier by learning, from a set of preclassified documents, the characteristics of the categories. The advantages of this approach over the knowledge engineering approach (consisting in the manual definition of a classifier by domain experts) are a very good effectiveness, considerable savings in terms of expert manpower, and straightforward portability to different domains. This survey discusses the main approaches to text categorization that fall within the machine learning paradigm. We will discuss in detail issues pertaining to three different problems, namely document representation, classifier construction, and classifier evaluation.Comment: Accepted for publication on ACM Computing Survey

    A Multimodal Approach to exploit similarity in documents

    No full text
    Automated document classification process extracts information with a systematic analysis of the content of documents. This is an active research field of growing importance due to the large amount of electronic documents produced in the world wide web and available thanks to diffused technologies including mobile ones. Several application areas benefit from automated document classification, including document archiving, invoice processing in business environments, press releases and research engines. Current tools classify or \u201dtag\u201d either text or images separately.In this paper we show how, by linking image and text-based contents together, a technology improves fundamental document management tasks like retrieving information from a database or automated documents. We present an investigation of a model of conceptual spaces for investigation using joint information sources from the text and the images forming complex documents. We present a formal model and the computable algorithms and the dataset from which we took a subset to make experiments and relative tests and results

    A Multimodal Approach to Exploit Similarity in Documents

    No full text

    Outcomes of Patients Presenting with Mild Acute Respiratory Distress Syndrome: Insights from the LUNG SAFE Study

    No full text
    WHAT WE ALREADY KNOW ABOUT THIS TOPIC: Hospital mortality in acute respiratory distress syndrome is approximately 40%, but mortality and trajectory in "mild" acute respiratory distress syndrome (classified only since 2012) are unknown, and many cases are not detected WHAT THIS ARTICLE TELLS US THAT IS NEW: Approximately 80% of cases of mild acute respiratory distress syndrome persist or worsen in the first week; in all cases, the mortality is substantial (30%) and is higher (37%) in those in whom the acute respiratory distress syndrome progresses BACKGROUND:: Patients with initial mild acute respiratory distress syndrome are often underrecognized and mistakenly considered to have low disease severity and favorable outcomes. They represent a relatively poorly characterized population that was only classified as having acute respiratory distress syndrome in the most recent definition. Our primary objective was to describe the natural course and the factors associated with worsening and mortality in this population. METHODS: This study analyzed patients from the international prospective Large Observational Study to Understand the Global Impact of Severe Acute Respiratory Failure (LUNG SAFE) who had initial mild acute respiratory distress syndrome in the first day of inclusion. This study defined three groups based on the evolution of severity in the first week: "worsening" if moderate or severe acute respiratory distress syndrome criteria were met, "persisting" if mild acute respiratory distress syndrome criteria were the most severe category, and "improving" if patients did not fulfill acute respiratory distress syndrome criteria any more from day 2. RESULTS: Among 580 patients with initial mild acute respiratory distress syndrome, 18% (103 of 580) continuously improved, 36% (210 of 580) had persisting mild acute respiratory distress syndrome, and 46% (267 of 580) worsened in the first week after acute respiratory distress syndrome onset. Global in-hospital mortality was 30% (172 of 576; specifically 10% [10 of 101], 30% [63 of 210], and 37% [99 of 265] for patients with improving, persisting, and worsening acute respiratory distress syndrome, respectively), and the median (interquartile range) duration of mechanical ventilation was 7 (4, 14) days (specifically 3 [2, 5], 7 [4, 14], and 11 [6, 18] days for patients with improving, persisting, and worsening acute respiratory distress syndrome, respectively). Admissions for trauma or pneumonia, higher nonpulmonary sequential organ failure assessment score, lower partial pressure of alveolar oxygen/fraction of inspired oxygen, and higher peak inspiratory pressure were independently associated with worsening. CONCLUSIONS: Most patients with initial mild acute respiratory distress syndrome continue to fulfill acute respiratory distress syndrome criteria in the first week, and nearly half worsen in severity. Their mortality is high, particularly in patients with worsening acute respiratory distress syndrome, emphasizing the need for close attention to this patient population
    corecore